Microsoft Word - manuscript

نویسندگان

  • Qiang Yu
  • Hongwei Huo
  • Jeffrey Scott Vitter
  • Jun Huan
  • Yakov Nekrich
چکیده

In recent years, there has been an increasing interest in planted (l, d) motif search (PMS) with applications to discovering significant segments in biological sequences. However, there has been little discussion about PMS over large alphabets. This paper focuses on motif stem search (MSS), which is recently introduced to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l, d) motifs present in the input sequences, and the superset is expected to be as small as possible. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a method for generating all possible motif stems without redundant wildcards. (3) We propose an efficient exact algorithm, called StemFinder, for solving the MSS problem. Compared with the previous algorithms, StemFinder runs much faster and first solves the (17, 8), (19, 9) and (21, 10) challenging instances on protein sequences; moreover, StemFinder reports fewer stems which represent a smaller superset of all (l, d) motifs. StemFinder is freely available at http://sites.google.com/site/feqond/stemfinder.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Author's response to reviews Title:Few additional genetic mutations accumulate during metastatic progression in high-grade serous ovarian cancer Authors:

1. Line NumberingPlease revise your manuscript to include line and page numbers. Authors are asked to ensure that line numbering is included in the main text file of their manuscript at the time of submission to facilitate peer-review. Once a manuscript has been accepted, line numbering should be removed from the manuscript before publication. For authors submitting their manuscript in Microsof...

متن کامل

Microsoft Word - JBC manuscript 09-1-13

Background: The functional importance of C2 insert containing isoform of nonmuscle myosin II-C is not known.

متن کامل

Microsoft Word - manuscript rev2 v2

Background: -synuclein is an aggregation-prone protein which reconfigures more slowly under aggregating conditions. Results: Curcumin binds to monomeric synuclein, prevents aggregation and increases the reconfiguration rate, particularly at high temperatures. Conclusion: Curcumin rescues the protein from aggregation by making the protein more diffusive. Significance: The search for aggregatio...

متن کامل

Avoiding ethical temptations

In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit:

متن کامل

Some problems, I care most

In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit:

متن کامل

Multi-armed spirals and multi-pairs antispirals in spatial rock–paper–scissors games

In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit:

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013